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Abstract 

Patrolling is one of the central problems in operational security. Formally, a patrolling problem is 
specified by a set U of nodes (admissible defender’s positions), a set T c (J of vulnerable targets, an 
environment E c U x U (admissible defender’s moves), and a function d which to every target u assigns 
the time d(u ) 6 N needed to complete an intrusion at u. The goal is to design an optimal strategy for a 
defender who is moving from node to node and aims at detecting possible intrusions at the targets. The 
defender can detect an intrusion at a target u only by visiting u before the intrusion is completed. The 
goal of the attacker is to maximize the probability of a successful attack, and the defender aims at the 
opposite. We assume that the attacker is adversarial, i.e., he knows the strategy of the defender and can 
observe her moves. 

We prove that the defender has an optimal strategy for every patrolling problem. Further, we show 
that for every s > 0, there exists a finite-memory s-optimal strategy for the defender constructible in 
exponential time (in the size of the game), and we observe that such a strategy cannot be computed in 
polynomial time unless P = NP. 

Since (sub)optimal strategy synthesis is computationally hard for patrolling problems in general 
environments, we continue our study by restricting ourselves to fully connected environments where 
E - U x U (where we can safely assume that T = U). Then, a patrolling problem is fully determined 
by its signature, i.e., a function S such that S ( k ) is the total number of targets with attack length equal 
to k. We assume that S is encoded by using binary numbers, i.e., the encoding size of S can be expo¬ 
nentially smaller than the number of targets. We start by establishing an upper bound on the value of a 
given patrolling problem, i.e., we bound the maximal probability of successfully defended attacks that 
can be achieved by the defender against an arbitrary strategy of the attacker. The bound is valid for an 
arbitrary patrolling problem such that T = U and depends only on the signature S. Then, we introduce a 
decomposition method which allows to split a given patrolling problem Q into smaller subproblems and 
construct a defender’s strategy for Q by “composing” the strategies constructed for these subproblems. 
Using this method, we can synthesize (sub)optimal defender’s strategies in time which is proportional to 
the encoding size of S. Consequently, we can compute (sub)optimal strategies for exponentially larger 
patrolling problems then the existing methods based on mathematical programming, where the size of 
the programs is proportional to the number of targets. Finally, for patrolling problems with T — U and 
a well-formed signature, i.e., a signature S such that k divides S (k ) for every k e N, we give an exact 
classification of all sufficiently connected environments where the defender can achieve the same value 
as in the fully connected uniform environment. This result is useful for designing “good” environments 
where the defender can act optimally. 
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1 Introduction 


A central problem in security and operational research is how to deploy limited security resources (such 
as police patrols, security guards, etc.) to maximize their effectiveness. Clearly, police patrols cannot 
be everywhere all the time, security guards cannot check every door every minute, etc., which raises a 
crucial question how to utilize them best. Game theoretic approaches to operational security problems 
based on Stackelberg model have received much attention in recent years (see, e.g., 1241 '). Informally, the 
problem is to find the best possible strategy for a defender who is supervising potentially vulnerable targets 
(such as airports, banks, or patrol stations) and aims at detecting possible intrusions. The time needed to 
complete an intrusion at each target is finite, and the aim of the defender is to maximize the probability 
of discovering an intrusion before it is completed. An intensive research in this area has led to numerous 
successful applications (see, e.g., EH El). Due to high demand for practically usable solutions, the main 
emphasis has been put on inventing methods that can produce working solutions for large-scale instances 
quickly. In most cases, the problem is simplified (for example, by restricting the set of defender’s strategies 
to some manageable subclass), and various tricks are used to avoid non-linear constraints and/or objectives. 
This approach enables efficient synthesis of strategies that are “good enough” for practical purposes (thus, 
the main engineering goal is achieved), but does not allow for synthesizing optimal or e-strategies (for a 
given e > 0) in general. Further, the size of the resulting mathematical program is usually proportional 
to the number of targets, which influences the scalability of these methods. Since developing the basic 
theory of the underlying game model has not received so much attention as designing practically usable 
solutions, many fundamental questions (such as the computability of the Strackelberg value, the existence 
and computability of an optimal/e-optimal defender’s strategy, etc.) are open or have even been answered 
incorrectly. In this paper, we provide a solution for some of these problems. As an unexpected payoff of 
our study, we also obtain a completely new approach to synthesizing defender’s strategies in security games 
with fully connected environment based on compositional reasoning, which avoids the use of mathematical 
programming and can be applied to exponentially larger instances than the currently available methods. A 
detailed explanation of the achieved results is given below. 

In this paper, we consider the adversarial variant of patrolling, where the attacker is assumed to be quite 
powerful—he can observe defender’s moves, and he even knows defender’s strategy. However, he cannot 
predict the way of resolving the defender’s randomized choice. Formally, a patrolling problem Q is specified 
by a finite set U of nodes (possible defender’s positions), a set T c U of targets, an initial node u e T (the 
initial position of the defender), an environment E c U x U (admissible moves of the defender) and a 
function d : T —> N which to every target associates the corresponding attack length. The defender starts 
at u and then moves from node to node consistently with E. We assume that traversing every edge takes 
precisely one unit of time (longer moves can be modeled by inserting intermediate nodes.) The defender 
may choose the next node randomly and independently of her previous choices. Formally, a defender’s 
strategy is a function cr : E( —> A(U) where 'H is the set of all finite non-empty sequences of nodes and 
A (U) is the set of all probability distributions over U. We require that cr is consistent with E, i.e., the support 
of crili) is a subset of nodes that are immediate successors of the last node of h. Note that each cr determines 
a unique probability space over all runs (infinite paths in ( U,E )) initiated in u in the standard way, and we 
use P ,T to denote the associated probability measure. 

Depending on the observed walk of the defender, the attacker may choose to attack some target or 
wait (we assume that the attacker may attack at most once during a play). More precisely, an attacker’s 
strategy is function n : 'H —> T U {J.} such that whenever 7t(h) + _L, then for all proper prefixes h! of h 
we have that t t{h') = _L. Since the attacker has a complete knowledge about the current position of the 
defender, he would never attack a target currently visited by the defender. Still, he may attack this target 
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Figure 1: Two examples of patrolling problems and the corresponding optimal defender’s strategies. 


immediately after the defender’s departure, i.e., long before the defender arrives to the next node (think of 
an UAV patrolling military bases). This assumption is reflected in the definition of a discovered attack—if 
the current location of the defender is u and the attacker attacks a target v, the defender has to visit the 
node v within the next d(v) time units to discover this attack, even if u = v. The aim of the defender is to 
maximize the probability of successfully detected (or not initiated) attacks, while the attacker aims at the 
opposite. Given a strategy cr of the defender and a strategy n of the attacker, we use V T CD\n\) to denote 
the probability of all infinite paths w initiated in u such that either n(h) = J_ for every prefix h of w (i.e., no 
attack is encountered along w), or n(h) = v e T for some prefix li of w and v is among the nodes visited after 
li in w in at most d(v) transitions (i.e., w contains a successfully defended attack). The value ofcr is defined 
by val{cr) = inl( T P ,t CD\j:\), where n ranges over all strategies of the attacker. The Stackelberg value of Q is 
defined by val = sup cr val(cr), where cr ranges over all strategies of the defender. A defender’s strategy cr* 
is e-optimal (where e > 0) if val{cr*) > val - s. A 0-optimal strategy is called optimal. 

Remark 1.1. In our definition of the patrolling problem, we assume that all targets are equally important 
to the defender (and the attacker). The results A and B presen ted below remain valid even if we extend the 
model by assigning numerical weights to nodes and modify the game objective so that the defender/attacker 
aims at maximizing/minimizing the expected weight of a discovered attack. If the weight (importance) of 
nodes is different for each player, the game is no longer zero-sum, and the solution concept becomes some¬ 
what different (consequently, our results do not apply in this case). 

Two simple examples. To get some intuition about the patrolling problem, we start with two simple exam¬ 
ples that will also be used to demonstrate some of our results. Let us first consider the patrolling problem 
of Fig. [H (left). Here, we need to patrol three nodes with the same attack length 2 (i.e., T = U), where liq 
is the initial node, in a fully connected environment. Let us try to determine the Stackelberg value and an 
optimal strategy of the defender. A naive idea is to pick a strategy cr which always selects each of the three 
immediate successors with probability 1/3. Consider a strategy n of the attacker such that n(uo) = ui. We 
have that r P <T (£)\nX) - 1/3 + 2/3 • 1/3 = 5/9, and one can easily verify that for every attacker’s strategy n' 
we have that r f yT CD\n' ]) > 5/9. Hence, val > val(cr) = 5/9. However, the defender can do better. Consider 
the strategy cr* defined in Fig.|T](left). Observe that cr* is independent of the currently visited node; the only 
relevant information about the history of a play is whether its length is even or odd. If it is even (odd), then 
cr* randomly selects between uq and ui (or between ii\ and uf) where the ratio between the two probabilities 
is the golden ratio. One can check that for every defender’s strategy n we have that P (T (D\n\) > (V5 - l)/2. 
Hence, val > (V5 - l)/2 > 5/9. In fact, the strategy cr* is optimal, i.e., val = (V5 - l)/2, which is perhaps 
unexpected (see also the paragraph “ Comments on D” below). 
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Now consider the patrolling problem of Fig. Q] (right). Flere we need to patrol five nodes (T = U)\ 
two of them have the attack length 2 and three of them have the attack length 3. Again, we assume a fully 
connected environment. If we examine a naive strategy cr which always selects the next node uniformly 
among all immediate successors, we obtain that val{cr) = 9/25. A better strategy cr* for the defender 
is shown in Fig. Q] (right). The strategy cr* depends only on the length of the history modulo 6, and it 
always chooses uniformly between exactly two nodes. It directly follows from our subsequent contributions 
(namely C) that val = val{cr*) = 1/2, i.e., cr* is optimal and the Stackelberg value is equal to 1/2. 

Our contribution. We start by proving the following results about the general patrolling problem: 

A. For an arbitrary patrolling problem, there exists an optimal strategy for the defender. 

B. Given a patrolling problem Q = ( U,T , u,E,d) and a rational e > 0, there is a finite-memory £-optimal 
strategy cr for the defender computable in time exponential in \\Q\\ and polynomial in e _1 (here, \\Q\\ is 
the encoding size of Q , where the attack lengths are encoded in unary). Further, valuer) is rational and 
can also be computed in exponential time, i.e., we can also approximate val up to a given e > 0 in 
exponential time. We also observe that val cannot be approximated up to the error smaller than |f/| -1 in 
polynomial time unless P = NP. 

Comments on A. The existence of optimal strategies for patrolling problems (and then - variants) has been 
claimed in previous works (see, e.g., El [5]) by arguing in the following way. For each j e N, let I 7 ' be 
the class of all defender’s strategies cr such that cr{h) depends only on the last j nodes of h. If we restrict 
the range of cr to the strategies of I 7 in the definition of Stackelberg value, we obtain an approximated 
value, denoted by val 1 . Obviously, vn/ 7+1 > vaV for every j 6 N. By adapting the results of |[T3l . it has 
been shown in f5j that for every j e N one can compute a strategy cr e X 7 which achieves the outcome 
vaV or better against every attacker’s strategy. In El 13], it has been also claimed that val = vaV for some 
sufficiently large j (without providing any upper bound). The argument is based on applying general results 
about strategic-form games, but a full proof is omitted. Using the techniques of Section [231 we prove that 
this claim is incorrect, even for the simple patrolling problem of Fig. Q] (right) where the defender has no 
optimal strategy in \JJ =l X 7 . In our proof of A, we take an infinite sequence of strategies cr\ , o~ 2 , ... such 
that lim (( ^oo val{cr n ) = val and “extract” and optimal strategy out of it. 

Comments on B. Our exponential-time algorithm for constructing an e-optimal strategy is based on combin¬ 
ing two main ideas. First, we show that the Stackelberg value of a given game stays the same when the initial 
target is changed. This implies that small perturbations in probability distributions employed by an optimal 
strategy cause only a small change in the strategy value. Hence, we can compute a suitable discretization 
scale and safely restrict the range of considered strategies to the discretized probability distributions. Let 
d = max„ S £/{<jf(«)}. The next important observation is that the d-step behaviour of every strategy (after some 
finite history) can be fully characterized by a real-valued vector with exponentially many components, where 
each component corresponds to a probability of visiting some vertex in at most k < d transitions. Due to 
the previous discretization step, we can safely restrict the range of these vectors to finitely (exponentially) 
many values. It follows that if there is some e-optimal strategy, then there is also an e-optimal strategy 
whose d- step behaviour (after every finite history) can be characterized by one of these exponentially many 
vectors, and we show how to check the existence of such a strategy in exponential time (this is perhaps the 
most difficult part of the argument). 

The lower complexity bound is trivial. Given a patrolling problem with d(u) = \U\ = k for all u £ U, we 
have that val = 1 iff the environment contains a directed cycle through all the nodes (i.e., it is a Hamiltonian 
digraph), which is NP-hard to decide. If the game is a negative instance, then for every strategy of the 
defender, the attacker clearly can launch an attack at the very beginning of a play with probability of success 
at least 1 /k. From this we immediately obtain the second part of B. Although in recent l20l . it is shown that 
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the problem whether val = 1 for a given patrolling problem is PSPACE-complete, the construction of ff20l 
only (for principal reasons) rules out, unless P = PSPACE, the existence of an e-optimal strategy for the 
defender with s < c ■ exp (—1£/|) for some c > 0. 

Since solving general patrolling problems is computationally hard, we continue our study by restricting 
ourselves to fully connected environments, where E = UxU. Observe that the defender has no reason to visit 
non-target nodes in fully connected environments, and hence we can further safely assume that T = U. For 
example, think of a surveillance system equipped with several cameras installed in front of various doors, 
where the footage of the cameras is shown in turns on a single screen (for some small constant amount of 
time) watched by a human guard. The time needed to break (open and close) different doors can be different. 
Then, the nodes/targets of the associated patrolling problem correspond to the cameras, the environment is 
fully connected (assuming one can switch between the cameras freely), and the transition time between two 
nodes is the same (and it can be normalized to 1). Under these assumptions, a patrolling problem is fully 
specified by its signature, i.e., a function S : N — > No which for a given k e N returns the number of all 
u € T with d(u) ~ k. An important subclass of signatures are well-formed signatures, where k divides S (k) 
for all k 6 N. For example, the signature of the patrolling problem of Fig.[T|(right) is well-formed, while the 
signature of the patrolling problem of Fig. [H (left) is not. We assume that signatures arc represented using 
binary numbers, i.e., the encoding size of S , denoted by ||S||, can be exponentially smaller than the number 
of nodes. 

Before formulating our results about the patrolling problem in a fully connected environment, we need 
to explain one important conceptual contribution of this paper, which is the notion of a modular strategy 
and the associated compositionality principle. A defender’s strategy cr is modular if cr(h) depends only on 
the length of h modulo some constant c (in particular, note that the current defender’s position is irrelevant). 
For example, the two strategies of Fig. [H are modular (the constant c is equal to 2 and 6 for the strategy 
on the left and on the right, respectively). Let Q be a patrolling problem with a set of nodes U. For every 
U' c U, let Q\ IJ' ] be the patrolling problem obtained from Q by restricting the set of nodes to U' and the set 
of transitions to E OU'xU' (note that this makes sense even if the environment of Q is not fully connected). 
Let U i,. . ., £4 c U, and let cr\, ... , 0 "^ be modular defender’s strategies in Q\ U\ |, . .. , Q\ t/& |, respectively. 
For every probability distribution v over {1, we can construct the v-composition of <xi,..., c t>, which 

is a modular defender’s strategy cr in Q[U i U- • • U C/*] defined by cr{h) = v\ -cri(h) + - • ■ + vy • (rf h). Note that 
cr is a correctly defined defender’s strategy for Q\U\ U • • • U Uk\ only if the environment of Q contains all of 
the required transitions between the nodes of U \,..., £4 (if the environment of Q is fully connected, this is 
no issue). It follows immediately that val{cr) > minj v/ • va/(cr ( ) | 1 < i < k) (as we shall see, this inequality 
can be strict ). Thus, one can construct a defender’s strategy for a given patrolling problem Q by splitting the 
set of nodes into two or more subsets (not necessarily disjoint), solving the smaller instances recursively, 
and then computing a suitable convex combination of the solutions. As we shall see momentarily, this 
approach leads to an efficient algorithm capable of computing optimal (or suboptimal) strategies for very 
large patrolling problems in couple of seconds. 

Now we can explain our main results about the patrolling problem in a fully connected environment. 

C. Given a patrolling problem Q where T = U, we have that val < {^kesuppts) Hr) where S is the sig¬ 
nature of Q and supp{S) is the set of all k e N such that S (k) > 0. This bound is valid for an arbitrary 
environment E. 

D. There is an algorithm which inputs a signature S of a patrolling problem Q with a fully connected 
environment (where T = U) and outputs a pair {6, V) such that the following conditions are satisfied: 

- The running time of the algorithm in polynomial in ||S||. 
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- 8 is a symbolic representation of a modular strategy for Q , and V is a symbolic representation of 
val{6). Both 8 and V are parameterized by variables {p i,..., pk), where k is bounded by a polynomial 
in ||5 ||. The values of {p\,..., pk } correspond to the unique solution (in [0,1]*) of a recursive system 
of polynomial equations that is also constructed by the algorithm. The number of variables k actually 
depends on the “Euclid complexity” of 5 and can be constant (or even zero) for arbitrarily large S. 

- If the signature 5 is well-formed, then k = 0 and the strategy 8 is optimal. Since k = 0, no extra com¬ 
putational time is needed to calculate/approximate the parameters, and hence 8 is “fully synthesized” 
in time polynomial in ||5||. 

- If the signature 5 is not well-formed, then the strategy 8 is a v-composition of simpler modular 
strategies and the variables defined via the system of polynomial equations correspond to the weights 
used to combine these simpler strategies together. Further, we have that val d < val{8) < val u , where 
val d and val u are the Stackelberg values of the patrolling problems with signatures S d and S u defined 
by S d (k) = k- LfJ and S u (k) = k-[ fj, respectively. 

E. Given a patrolling problem Q with T = U and a well-formed attack signature 5, we say that the environ¬ 
ment E of Q is sufficiently connected if val is equal to the value of Q in the, fully connected environment. 
The problem whether E is sufficiently connected is NP-complete. Further, this problem is NP-complete 
even for a subclass of patrolling problems such that supp{S) = {kj, where k > 3 is a fixed constant. For 
a subclass of patrolling problems where supp(S) = {2}, the problem is solvable in polynomial time. 

Comments on C. Note that the presented upper bound on val does not depend on E. An obvious ques¬ 
tion is whether this bound is tight. That is, given a function S : N —> No such that supp(S) is finite, 
we ask whether there exists a patrolling problem Q with T = U such that the signature of Q is 5 and 
val = l/(£jkesupp(S) S(k)/k). It follows from our results that the answer to this question is yes if 5 is well 
formed. This means that the bound can be potentially lowered (only) for those 5 that are not well formed. 
As an example, consider the patrolling problem of Fig. Q] (left). Here supp(S) = {2} and 5(2) = 3, and 
hence we obtain val < 2/3. Since val = (V5 - l)/2 < 2/3, the bound is not tight. For the patrolling 
problem of Fig. Q] (right) we have that supp{S) = {2, 3), 5(2) = 2, and 5(3) = 3, which gives an upper bound 
(2/2 + 3/3) -1 = 1/2. Since val = 1/2, this bound is tight. 

Comments on D. The strategy 8 is obtained by applying the “decomposition” technique described earlier. 
Since we intend to produce a strategy synthesis algorithm whose running time is polynomial in ||S||, we 
also need to design a special language allowing for compact representation of modular strategies in space 
polynomial in ||S| (see Section 12.41 ). First, we split the nodes of Q into disjoint subsets according to their 
attack length. Then, we show how to compute a modular strategy for a set of n nodes with the same attack 
length d. Here, we use a decomposition technique which resembles Euclid's ged algorithm. First we check 
whether d divides n. If so, we split the n nodes into pairwise disjoint sets Uq, ..., U d -i so that |C/,j = n/d 
for every 0 < i < d, and define a modular strategy cr such that cr{li) selects uniformly among the elements 
of Ui, where i = \h\ mod d. Observe that val{cr) = d/n, which is optimal by C. If d does not divide n and 
n = k ■ d + c where 1 < c < d, then we split the n nodes into two disjoint subsets U\ and U 2 , where U\ 
contains k ■ d nodes and Ui contains c nodes. A strategy oq for JJ\ is constructed as above, and we need to 
process the set t/ 2 - If c divides d, the strategy cr 2 for (A is a simple loop over the nodes of (A. A closer 
look reveals that an appropriate distribution v = (vi, v 2 ) for combining oq and cr 2 should satisfy the equation 
v\ ■ valuer 1 ) = 1 - Vj which says that the nodes of U\ and (A are defended equally well. If c does not divide 
d and d = j ■ c + t, where 1 < t < c, then the strategy cr 2 for (A spends the first j ■ c steps by performing 
the simple loop over the nodes of t/ 2 , and the next t steps by behaving exactly as the strategy constructed 
for |£/ 2 | nodes with attack length t (which is constructed recursively). Then, cr 2 just keeps repeating its first 
d steps. Again, we can setup an equation that should be satisfied by an appropriated distribution which 
combines cri and cr 2 so that all targets are protected equally well. This procedure eventually produces a 
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modular strategy for defending n nodes with the same attack length cl. If cl divides n, then this strategy is 
provably optimal. In fact, we conjecture that the constructed strategy is always optimal, but we leave this 
hypothesis open (recently, it has been shown by Lamser If22l that the algorithm produces an optimal strategy 
for all odd n and d = 2). Further, let us note that the number of variables/equations in the constructed 
system of polynomial equations is bounded by a polynomial in ||51|, but the size of S is not a good measure 
for identifying hard instances. What really matters is the number of “swaps” in the Euclid’s algorithm 
applied to n and cl] see Section 12.41 for further comments. After processing all subsets of nodes with the 
same attack length, we combine the resulting strategies using an appropriate distribution. The details are 
given in Section [2~4l 

As an example, consider the patrolling problems of Fig. [Q In the first case, we have 3 nodes with the 
same attack length 2. Since 2 does not divide 3, we split the set of nodes into U\ = [uq,u\] and U 2 = {u 2 }. 
The strategy cr 1 for U 1 selects the node ii\ or uq with probability 1, depending on whether the length of the 
history is odd or even, respectively. Note that val{cr\) = 1. For the set U 2 , we have that |F/ 2 I divides 2, and 
so the strategy cr 2 is a self-loop on u 2 . The appropriate distribution v = (vi, v 2 ) for combining oq and cr 2 
should satisfy the equation v\ = 1 - v 2 . Thus, we obtain that v = k = (V5 - l)/2, which yields the strategy 
of Fig. Q] (left). The strategy of Fig. [Tj (right) is obtained by first splitting the set of nodes into U\ = ho, 1 1 i 
and U 2 = {vo, vi, V 2 } according to their attack length, solving these subproblems (note that the solution for 
Uj is a strategy which loops over the vertices of £/,), and then combining them with v = (0.5,0.5). 

Comments on E. We show that for every patrolling problem Q = (U,T, u, E, d) with T = U and a well formed 
signature S , there exists a characteristic digraph Ms depending only on S and computable in polynomial 
time, such that E is sufficiently connected if, and only if, (U, E) contains a subdigraph isomorphic (respecting 
the attack lengths) to Ms . From this we immediately obtain that the problem whether a given E is sufficiently 
connected is in NP, and we also provide the matching lower bound. Note that the characteristic digraph can 
be used to synthesize a minimal sufficiently connected environment for solving a given patrolling problem. 

Related work. Two player zero-sum stochastic games with both perfect and imperfect information have 
been studied very intensively in recent years (see, e.g., cm S3 si), also for games with infinite state-space 
llT0l lT6iS5ll2l. Patrolling games have so far been considered mainly in the context of operation research. 
Flere, the emphasis is usually put on finding methods allowing to synthesize a sufficiently good defender’s 
strategy, and the basic theoretical questions related to the underlying formal model are usually not studied 
in greater detail. The problem of finding locally optimal strategies for robotic patrolling units have been 
studied either in restricted environments (e.g., on circles in EHH), or fully-connected environments with 
weighted preference on the targets fl6][7]]. Some novel aspects of the problem, such as variants with moving 
targets (9j [771 , multiple patrolling units |Sj, or movement of the attacker on the graph Q and reaction to 
alarms 1231 have also been considered in recent works. 

2 The results 

We assume familiarity with the notions introduced earlier in Section Q] 

2.1 The existence of an optimal defender’s strategy 

We start by proving that there exists an optimal strategy for the defender. This is a generalization of similar 
results recently achieved in |[Q for a special type of patrolling games where all nodes share the same attack 
length (i.e., supp{S) is a singleton). The proof technique is completely different. 

Theorem 2.1. For every patrolling problem Q = ( U, T, u, E, d), there exists an optimal defender’s strategy. 
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Proof Sketch. We construct an optimal strategy cr* as a point-wise limit of a sequence cr 1 , cr 2 ,... of strate¬ 
gies where each cr k is 1/k-optimal. More precisely, we select cr 1 , cr 2 ,... in such a way that for each history 
h, the sequence of distributions cr 1 (/?), cr 2 (/i),... converges to a probability distribution, and we define cr*(h ) 
to be its limit (we obtain cr 1 ,cr 2 , ... by starting with an arbitrary sequence of 1/^-optimal strategies and suc¬ 
cessively filtering subsequences that are convergent on individual histories). It is relatively straightforward 
to show that if valuer*) < val - 6 for some 5 > 0, then for all k’s large enough we have val(cr k ) < val - 6/ 2, 
which contradicts the fact that each cr k is 1/k-optimal. For details see Appendix [B] □ 

2.2 Computing finite-memory e-optimal strategies 

In this subsection we describe a generic algorithm which for a given patrolling problem computes a finite 
representation of an e-optimal strategy. Let us start with the definition of a finite-memory strategy. 

Definition 2.2. A finite-memory defender’s strategy is a tuple (M, N, «?o, $) where M is a finite set of memory 
elements, N : MX U —> M assigns to every memory element m £ M and a node u £ U a next memory element 
N(m,u), mo is an initial memory element, and f : M X U —» A (U) is a function which to every memory 
element m £ M and a node u £ U assigns a distribution f(m, u) on U such that supp(f(m, u )) c succ(u). 

A finite-memory defender’s strategy induces a defender’s strategy cr as follows: We extend 

N to an "empty" history s by N(mo, s) = mo, and to all histories hv £ 77, here v £ U, inductively by 
N(mo, hv) = N(N(mo, h), v). Then for hu £ 77 (where u £ U)we have that cr(lm) = g(N(mo, h), u). 

Theorem 2.3. Let e > 0 and assume that u £ T. There is an e-optimal finite-memory defender’s strategy 
computable in time 

3 - |tm 0(W) 


We construct our strategy using the so-called characteristics (some intuition is given below). 

Definition 2.4. A characteristic c is a triple (r, s, c) where r £ U, s is a probability distribution on U, and 
c : {2,..., d] x T —> [0,1]. Denote by Char the set of all characteristics. Given c = (r, s, c) £ Char, we 
denote by val(c) the value min „ e 7 c (d(u), u) ofc. 

Given a characteristic c, we use c r , c s , c c to denote the three components ofc = (r, s, c), respectively. 
Intuitively, we interpret a given characteristic c as a "local" plan of defence for next d steps where 

• c r is the current node, 

• c s is the current assignment of probabilities to the successors of c r , and 

• for every 2 < k < d and every u e T, we interpret c c (k, u) as the probability of visiting u in at least one, 
and at most k steps from c r .u 

To simplify our notation, we denote by c c (l, u) the probability c s (u) for every u £ T. 

Now assume that the current plan is formalized by a characteristic c, and suppose that the defender 
makes one step to a next vertex v chosen randomly with probability c s (v). Now the defender declares a 
new plan, c v £ Char where c' r = v. However, the crucial observation is that the new plans (c v ) ve u must be 
consistent with the original plan c in the following sense for all 2 < k < d and all u £ T : 

c c (k, u) = c s (u) + ^ c s (v) • c x c (k - 1, u) 

v+u 

'Note that many characteristics are not “consistent” (if e.g. c s (u) = 1/2 and c c (1,m) = 1/4). But later we make sure that only 
consistent characteristics are used. 
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We say that such a vector ( c v ) ve u e Char* 7 of characteristics is a successor of c. 

Now let C be a finite set of characteristics such that every c e C has a successor ( c v ) ve u e C u (i.e., 
c v € C for all v e U), and there is at least one c e C such that c r = u. We say that such C is closed. 
We construct a finite-memory strategy (M,N,mo,f) where M = C, N{c,v ) = c v , mo = c, and £(c) = c s . 
Intuitively, the strategy follows the plans in C and always proceeds to the next plan according to a fixed 
successor in C u . We prove that this strategy works consistently with the characteristics of C, i.e., whenever 
the current history is li and the current memory element is c, then, subsequently, the probability of reaching 
u in at least one, and at most k steps is equal to c s (k, u). Thus the value of the finite-memory strategy cannot 
be worse than min ce c val{c). 

So, the computation of a finite-memory strategy reduces to a computation of a finite closed set of char¬ 
acteristics. We show that one such set can be extracted from a carefully selected e-optimal strategy. Given 
a defender’s strategy cr, we denote by Filer) the set of all histories that cr may follow with a positive prob¬ 
ability. Given a strategy cr and a history h e 'Id Ur), we define a characteristic c[cr,h\ such that c[h] r is 
the last node of h, c[/j] s = <r(h), and each c[h] c (k, u) is the probability of reaching u in at least one, and at 
most k steps starting with the history h using cr. Now let cr* be an optimal strategy. The crucial observation 
(see also Proposition 1C. II in Appendix P is that for every li e Idler) it holds that val{c\cr* ,li\) > val. By 
appropriately rounding probabilities in cr*, we obtain an e-optimal strategy cr E such that for every history h 
and every u e U : 

cr s (h)(ii) = k ■ \d ■ |G|/el _1 for a suitable k 6 N 


and c[cr e , li\ > val - s for all h € 'Idler 1 ) ■ 

Now it is rather straightforward to show that for each h, the vector (c\hv\) v€ u is a successor of c\h\. 
Thus the set Char[cr £ ] of all c\h\, here h e ‘Id, is a closed set. It is also finite, of size that is bounded by 

/ - \0{d 2 -\U\) 

yd ■ |(/|/e) , and every c e Char[cr ] satisfies val(c) > val - e. This shows that there always exists a 

/ - \0(cP--\U\) 

e-optimal finite-memory strategy of the size bounded by yd ■ |(/|/el 

Our algorithm computes a closed subset C of a (finite) set of appropriately rounded characteristics that 
maximizes min c€ c val{c). This is done by a simple iterative procedure which maintains a growing pool of 
characteristics (in order of decreasing value) and tries to find its closed subset. For details see Appendix 0 


2.3 A bound on the Stackelberg value 

Now we establish an upper bound on val which depends only on the attack signature S of Q. The simplicity 
of the argument is due to Proposition ID. II 

Theorem 2.5. For every patrolling problem Q = (JJ,T,u,E,d) such that T = U, we have that 
val < (Yikesupp(S) Hf) where S is the attack signature ofQ. 

Proof Sketch. Intuitivelly, every node u has to be visited by the defender with propability at least val during 
each d{u) consecutive steps. Hence, summing the probabilities of visiting u in each of the steps from 1 to 
l = n kesupp(S)k we need to reach a value greater than or equal to val ■ f:/d(u). Summing these values for 
all nodes we have at least f„ Cz u val ■ i/diu). Note that in each step we visit some node with probability one 
and so, the sum for all nodes and £ steps is just £. This implies the theorem due to t > Yaieu va ^ ' £/d(n) = 
£ ■ val ■ fk€.uipp(S) S (k)/k. For more details see Appendix iDl □ 






2.4 Solving patrolling problems with a fully connected environment 

Let Q ~ (U, T,u,E,d) be a patrolling problem where T = U and E = U x U , and let S be the signature 
of Q. Recall the notion of modular strategy and the associated decomposition principle introduced in Sec¬ 
tion [0 In particular, recall that a (/-modular strategy c r for Q is fully represented by probability distributions 
po,... ,Hd-\ over U such that cr(/i) = /q where i = \h\ mod d. 

We start by considering the case when Q has n nodes with the same attack length d. Since we aim at 
developing a strategy synthesis algorithm polynomial in ||S||, we need to invent a compact representation 
of modular strategies which is sufficiently expressive for our purposes. We assume that the nodes of U are 
indexed by numbers from 1 to |I/|, and we use U(i,N ) to denote the subset of U consisting of N subsequent 
nodes starting from i, i.e., all u/ where i < { < i + N and 1 < i < i + N - 1 < |I/|. Let us consider the class 
of expressions determined by the following abstract syntax equation: 

6 ::= Circle(U(i,N),M,L ) | Bp, fa I , 6 2 \ 

Here, M, L e N such that M divides N, and p ranges over a countable set of variables Var. Assuming some 
valuation a : Var —> [0,1], every expression 6 determines a modular strategy for U defined inductively 
as follows: Circle(U(i,N),M,L) is a modular strategy which splits U(i,N ) into pairwise disjoint subsets 
of size M and then “walks around” these sets L times, f )\; 62 is a modular strategy which “sequentially 
alternates” between 8 \ and 6b- and v p \9 \, 82 ] is a strategy which “composes” 6 \ and 62 using the distribution 
(1 - a(p), a{p)). A detailed description of the semantics is given in Appendix[E] 

Our strategy synthesis algorithm is a recursive procedure Defend which inputs a triple ( U(i,N),D,e ), 
where U(i,N ) is the set of nodes to be defended, D is the number of steps available for defending U(i,N), 
and e is an expression which represents the “weight” of the constructed defending strategy in the final 
distribution v. The procedure outputs a pair (0 , V) where 6 is an expression specifying a D-modular strategy 
for U(i,N), and V is an arithmetic expression representing the guaranteed “coverage” of the targets in 
U(i,N ) when using 8 with the weight e . As a side effect, the function Defend may produce equations for 
the variables that are employed in symbolic strategy compositions of the form v p [ 8 \, 82 \. The algorithm is 
invoked by Defend/ (7(1, |(7|), d , 1), and the system of equations is initially empty. The recursion is stopped 
when D divides N or N divides D, and in these cases Defend provably produces strategies that achieve the 
best coverage for every value of e . In the other cases, Defend proceeds recursively by splitting either the 
set of nodes or the number of steps available to protect the nodes. In both cases, Defend tries to exploit the 
available resources in the best possible way. A full description is given in Appendix |E] At the very end, 
we obtain a d-modular strategy cr for Q specified by an expression 8 whose size is polynomial in ||S||, an 
expression V which represents val{c r), and we also obtain a system of polynomial equations for the variables 
which parameterize 8 and V. The system has a unique solution in [0, I ] k (where k is the number of variables) 
that corresponds to the intended valuation. The size of k can be, for given n > d , computed as follows: we put 
no = 11 and do = d, and then n- t+ \ = n, mod d,- and d , + 1 = d, mod n l+ \. The number of variables for n and d is 
equal to the least index j such that dj divides nj . In particular, if d divides /;. there is no variable at all, and our 
algorithm immediately produces a strategy which achieves the value d/n, which is optimal by Theorem 12.51 
As an example of a “hard” instance, consider n = 709793170386861531 and d = 37248973638339152, 
which requires 30 variables and equations. The solution (producing val(cr) = 0.05247471678) can be 
computed by Maple in fractions of a second. It has been recently proved by Lamser ll22j that our algorithm 
produces na optimal strategy also when d = 2 (for arbitrary n), which includes the example of Fig. Q] (left). 
Since the algorithm seems to exploit the available resources optimally, we conjecture that it actually outputs 
an optimal strategy for all parameters. 

To solve a patrolling problem with a general signature S , we simply split the nodes into disjoint subsets 
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according to their attack lengths, solve these subproblems by the above algorithm, and then compose the 
modular strategies so that all nodes are defended equally well. One can easily check that if S is well formed, 
this leads to a strategy whose value matches the bound of Theorem 12. 5 1 Thus, we obtain the following: 

Theorem 2.6. Let Q be a patrolling problem with T — U, a fully connected environment, and a well formed 
signature S. Then there is an optimal modular strategy cr computable in time polynomial in ||S||. 

2.5 A characterization of sufficiently connected environments 

For the rest of this subsection, we fix a patrolling problem Q = {U,T,u, E, d) with T = U and a well-formed 
signature S. We classify the conditions under which E is sufficiently connected (recall that E is sufficiently 
connected iff the value for Q is the same as the value for Q when E is replaced with the fully connected 
environment U x U. Let Ms be a digraph with vertex labelling d constructed as follows: 

• For all k e supp(S), i £ {0,..., k-l], and j e {1,... ,S(k)/k}, we add a fresh vertex vfii, j\ and set 
d(vk[i,j I) := k. Flence, M s has exactly ^kesupp(S) S (&) vertices. 

• For every pah - of vertices vy [ i, j] and vt [/', /], there is an arc from vy [/, j] to vp [/', /] in Ms iff there is 
some 0 <£ <k ■ k' such that i = C mod k and i' - ( £+\)mod k'. 

Note that Ms is computable in polynomial time. We prove the following: 

Theorem 2.7. Let Q = (U,T,u, E,d) be a patrolling problem such that T = U and the signature S of Q 
is well formed. Then E is sufficiently connected iff (JJ,E ) contains a subdigraph H which is d-preserving 
isomorphic to M s (i.e., if x of H is mapped to y of Ms then d(x) = d(y)). 

The “if” part of Theorem 12.71 is trivial, because if ( U,E ) contains a subdigraph M s , then we can implement 
the optimal modular strategy constructed by the algorithm of Subsection 12.41 The “only if” part is more 
challenging. The crucial observation is that the defender is not allowed to visit any target u twice within 
d{u) steps whenever she is aiming to reach the bound of Theorem 12.51 The underlying observations also 
reveals that every optimal strategy cr starts to behave like the strategy cr* after every history which visits 
all nodes. Flence, the strategy cr* does not belong to IJylj L 7 , except for some trivial cases (see Section Q3. 
A proof of Theorem 12.71 is given in Appendix [FJ An immediate consequence of Theorem 12.71 is that the 
problem whether a environment E is sufficiently connected is in NP. We complement this by a matching 
lower bound in the following theorem with a full proof in Appendix [Gj 

Theorem 2.8. The problem whether the environment of a given patrolling problem Q = (U,T, u, E, d)), such 
that T = U and the signature S of Q is well formed, is sufficiently connected, is NP-complete. Further, this 
problem is NP-complete even for a subclass of patrolling problems such that supp(S) = {k }, where k > 3 
is a fixed constant. For a subclass of patrolling problems where supp(S) = {2}, the problem is solvable in 
polynomial time. 

3 Open problems 

Our proof of the existence of an optimal defender’s strategy (Theorem 12. II) does not allow to conclude 
anything about the structure of optimal strategies. One is tempted to expect that optimal strategies are in 
some sense “regular” and require only finite-memory, but our present understanding does not allow to prove 
this conjecture. Another challenge it to lift the presented compositional technique to a more general class 
of patrolling games (such results would have a considerable practical impact). Finally, the question whether 
the algorithm of Section \2A\ produces an optimal strategy for all inputs is also interesting but left open. 
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A Detailed definitions for appendices 


We use N and No to denote the sets of positive and non-negative integers, respectively. The sets of all finite 
and infinite words over a given alphabet T are denoted by P and P'\ respectively. We write e for the empty 
word. The length of a given tv e T* U P is denoted by |w|, where the length of an infinite word is oo. We 
denote by V £k the set of all words vv e P satisfying |w| < k. The last letter of a finite non-empty word vv 
is denoted by last(w). Given a (finite or infinite) word w over T, the individual letters of w are denoted by 
wo vv i • • •. Given two words w, w' e P U V°‘ we write vv < tv' whenever w is a prefix of w', i.e., whenever 
there exists a word vv" e V* U V‘ !> such that w' = ww". Further, we write vv < vv' whenever vv < vv' and 
vv + vv'. 

Given a finite or countably infinite set A, a probability distribution over A is a function 6 : A —> [0,1] 
such that Yiaesupp(S) <5(«) = 1. The support of 6 is the set supp(5) = {« e A | (5(a) + 0). We use A (A) to denote 
the set of all distributions over A. A distribution 6 e A (A) is positive if 6(a) > 0 for every a £ A, and rational 
if 6(a) is rational for every a e A. 

Definition A.l. A patrolling problem is a triple Q = (U, T, u, E, d) where U is a finite set of nodes, T c U 
is a set of targets, u e T is the initial target, E c U x U is an environment, and d : T —> N assigns to each 
target the associated attack length. The attack signature of Q is a function S : N —> No where S(k) is the 
cardinality of{u e U \ d(u ) = k). We use supp(S) to denote the set [k e N | S ( k) T 0}. We say that S is well 
formed ifk divides S (k)for every k <S N. By d we denote ma x ue u{d(u)}. 

Let Q = (U, T, u, E, d) be a patrolling problem. We say that E is fully connected if E = U x U. Given a node 
u e U, we denote by succ(u ) the set {»' e U \ (u, u') e E } of all successors of u. A path is a finite or infinite 
word w € U* U U“' such that (w,-, w,-+i) e E for every 0 < i < \w\. A history is a finite non-empty path, and a 
run is an infinite path. The sets of all histories and runs are denoted by El and R, respectively. Given a set 
of histories H c El, we use R(H) to denote the set of all runs co such that w < co for some vv € El (when 
El = {/?}, we write R(h) instead of R(\h\)). 

Definition A.2. A defender’s strategy is a function cr : El —» MU) such that supp(cr(h)) c succ(last(h)) for 
every h e El. The set of all defender’s strategies is denoted by I. 

An attacker’s strategy is a function n : r H —» TU(1| such that whenever n(h ) + ±, then for all h' < li 
we have that n(h') = J_. We denote by II the set of all attacker’s strategies. 

Intuitively, given a history h, the defender chooses the next node randomly according to the distribution 
cr(h), and the attacker either attacks a node u e T (n(h) = u), or waits (n(li) - _L). Note that the attacker 
can choose to attack only once during a play, and also note that he cannot randomize. This is because 
randomization does not help the attacker to decrease the Stackelberg value, and hence we can safely adopt 
this restriction from the very beginning. 

For a given strategy cr e E, we define the set El(cr) c 'H of relevant histories, consisting of all h e El 
such that for all IT e El and u e U where ITu < h we have that cr(lT)(u) > 0. Note that a defender’s strategy 
cr determines a unique probability space over all infinite paths initiated in a given u e U in the standard way 
(see, e.g., fT2l0 . and we use to denote the associated probability measure. 

Given an attacker’s strategy n, we say that a run w contains a successfid attack if there exist a finite 
prefix h of vv and a node u € T such that n(li) = u and u is not among the first d nodes visited by w after the 
prefix h. For every node u e U, we use l)„[7r] to denote the set of all defended runs initiated in u that do not 
contain a successful attack. Hence, l 'P l [ l CD u \n\) is the probability of all runs initiated in u that are defended 
when the defender uses the strategy cr and the attacker uses the strategy n. We omit the subscript u in 
and [zr] when u = u. 
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Definition A.3. For all u e U and cr e Z, we denote by val u (a ) z/z<? value of cr defined by val u {cr) — 
inf ffe n The Stackelberg value ofu is defined as val u = sup (r(El val fa). A defender’s strategy cr* 

is optimal in u if valfia*) = val u . The value ofu is denoted by val, and a strategy which is optimal in u is 
called just optimal. 

At some places, we consider strategies obtained by “forgetting” some initial prefix of the history. For¬ 
mally, for all h e 'H and a strategy 6 of the defender/attacker, we define a strategy 0/, by GfulT) = GlhIT) for 
every u e U and IT e FI. Note that cr/, behaves similarly for all initial nodes. We are typically interested in 
its behavior starting in lastfh), which corresponds to behavior of cr when started at /?. 

In what follows, we also use the notion of an immediate attack value. Given a defender’s strategy cr, a 
history h € Filer), and a node u e U, we define att-valfa, u) to be the probability of reaching u from lastfh) 
in at least one and at most dfu) steps using the strategy cr/,. Intuitively, att-valh{cr,u) is the probability of 
defending u assuming that the attack on u starts after the history /;, i.e., n(fi) = u. It is easy to see that 

P IT (D\n\) = ^ r P' r (h) ■ alt-val far, n(h)) 

he'J-t (cr) 

7T(h)J=± 

B The existence of an optimal defender’s strategy 


Theorem 12.11 For every patrolling problem Q = (U, T, u, E, d) there exists an optimal defender’s strategy. 

Proof. We construct an optimal strategy cr* as a point-wise limit of a sequence cr 1 , cr 2 ,... of strategies where 
each cr k is 1 //.--optimal. More precisely, we prove the following. 

Claim : There is a sequence of defender’s strategies cr 1 , cr 2 ,... and a defender’s strategy cr* such that 

• each cr 1 is 1 //-optimal, i.e., valla 1 ) > val - 1/z, 

• for every h e Flfa) and every u e U we have that lini/^co a\h)lu) = a*(h)(u). (In particular, the limit 
exists for every li and u.) 

Proof: Assume a lexicographical ordering < on histories of FI. To simplify our notation, we consider an 
"empty" history e such that e < li for every h e FI. We consider histories h successively according to < and 
inductively define sequences a hil ,a h ’ 2 , ... of defender’s strategies so that the following holds: 

A. each a h ’’ is 1 //-optimal, 

B. a h,l ,a lu2 , ... is a subsequence of all preceding sequences a h ,1 ,a h ,2 ,... for IT < h, 

C. for every IT < h the sequence of distributions a h,l lh!), a h ' 2 lh'),... converges (point-wisely) to a proba¬ 
bility distribution. 

Then it suffices to put a 1 = a h ,|/ ' 1 where h is the z'-th history according to <, and to define cr*(/z) = 
lim^co afh). 

We define cr /M as follows: 

• For every z e N, we define cr fV to be an arbitrary 1 /z-optimal strategy. 

• Assume that <r h , <r h - 2 ,... has already been defined for IT. Consider a next history h according to <. 
As the space of all probability distributions on U is compact, there exists a subsequence a h,l ,a h ' 2 , ... of 
a 1 1' 1 , a h ’ 2 ,... such that a h,l lh) converges (point-wisely) to a probability distribution on U. 

The sequences apparently satisfy the above conditions A, B, C. ■ 

We prove that the defender’s strategy cr* obtained in the above Claim is optimal. Suppose that cr* is not 
optimal, i.e. valla*) < val - 5 for some 6 > 0. Then there is an attacker’s strategy n such that F ,t (D\t:\) < 
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val -6/2. For every i € N, let 717 behave as n on runs where n attacks before /-th step, and do not attack at 
all on the rest. 

Claim : lim MOO ^ ,r *(£>[ 7 T, ]) = P‘ T '(TJ\n\) 

Proof: Note that 

P ,J (D\n\) = ^ (h) ■ att-valh(cr*, n(h)) 

he'H(a- t ) 

n(h)i=± 

- ^ (h) ■ att-valf,(<r*,7r(h)) + ^ P a (h) ■ att-valhlcr*,n(h)) 

/iG*K(cr*) he*H(o ■*) 

7r(h)^± 7r(h)^± 

\h\<i \h\>i 

= P lT \D[n]) + Pi 

where p ,• is the probability that the the attacker starts his attack after i. Clearly, p,- —> 00 as i —» 00 , which 
proves the claim. ■ 

Thus for a sufficiently large i we have that P ,T (D[i r,]) < val - 6/4. Now observe that for all sufficiently 
large leNwe have \P ,T (T)\jTi ]) - P ,T {T)\ni\)\ < 6 /8 because the transition probabilities determined by cr* 
and cr k on the first i + d steps are getting closer and closer with growing k. Flowever, then we obtain that 
P ,T CD\ni ])| < val - 6/ 8 , which means that cr k cannot be I /^-optimal for large k. □ 

Proposition B.2. Assume that u is a target. There every optimal defender’s strategy cr* satisfies 

inf min att-val/fa* ,u) > val ( 1 ) 

h&H(&*) ueT 

Proof. Recall that we denote by val u and val u (cr ) the values of Q and of cr, resp., when u is used as the initial 
node instead of u. It suffices to prove Proposition IB.21 under the assumption that val = vain = max,,(= 7 - val u , 
because then we obtain, as a consequence, that val u = vain f° r a ll 11 *= T. Indeed, using cr*, every target 
node has to be visited. So given u 6 T, there is a history h e Tiler*) such that u = last(h). Flowever, note 
that © holds also for cr* instead of cr* , and thus val u {cr* h ) > val. As vain = val is maximal, we obtain that 
val u = vain. 

So assume that val = vain = max„ g 7 - val u . Let cr* be an optimal strategy. Note that val = ma \ lie jval u 
implies vali ast (hfcr* h ) < val for every history h e 'H(cr) such that last(h) e T. We obtain that vali ast (h)(crh ) ^ 
val for every history h e Ti{cr) because even if last{li) is not a target, cr/, starting in last{li) must visit a target 
almost surely and the attacker may wait until it happens. 

We claim that cr* satisfies ©, i.e. that att-vaffa*,u) > val for all h e Tiler*) and all u e T. Indeed, 
assume that att-vaffer*, u) < val - 6 for some 6 > 0 and h e Tiler*) and u e U. Assume, w.l.o.g., that cr* 
follows the history h with probability at least 6. 

Note that due to va// as ,(/,)(cr*) < val for every h. the deficiency of cr* at h cannot be compensated on 
other histories. We obtain the following: Let A be the set of all histories h’ of length |/;| (i.e., in particular, 
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h e A). Then 


val(cr*) 


< 


< 


< 


< 


Y P”* (V) ■ vali astW) {o-* h ,) 
h'eA 

P <T (h) ■ vahast(h){cr* h ) + Y ^ ^ ' val last(h’){cr* h ,) 
h'eA\{h 1 


P <T (h) ■ vali ast(h) (cr* h ) + Y ^ ^ ' val 

h'eA\{h\ 

P (T (h) ■ min att-valh(cr*,u) + V* P‘ T Hi) ■ val 

ueU 

h'eA\[h | 

P ,T (h) ■ (val - 6) + Y < P <T *(h ')' va -l 

h'eA\{h | 


val - P a (h ) • 5 
val - 8 2 


This contradicts the fact that <r* is optimal. 


□ 


C Computing finite-memory e-optimal strategies 


C. 1 Proposition IC.ll 

Let us fix a patrolling problem Q = (U, T, u, E, d). 


Proposition C.l. Given s > 0, there is an e-optimal strategy o~ E such that for every history h and every 
u e U it holds 


cr s {h){u) = k ■ \(\U\d)ls\ 1 for a suitable k £ N 


and 


min att-valifcr 8 , u) > val - s. 
ueU 

Proof Let cr be an optimal strategy. Let U = {u\, ui, ■ ■ ■, u\u\) be the set of nodes of Q and define s - 

mw/sr 1 . 

For every history h and every 1 < i < |t/|, we inductively define (r r '(h)(uj) = k t • s, where kj is the largest 
number satisfying 


(-1 

kj - s < cr(h)(ui) + Y^Miuj) ~ o- s (h)(uj )) . 

7=1 

This rounding procedure guarantees that cr E (h ) is indeed a probability distribution over U, i.e. 
Tiueu o~ K (h)(u) = 1 (note that simple rounding would not guarantee this property). Further, when we re¬ 
alize the invariant 0 < X'/Ji (<r(h)(uj) - cr s (li)(uj)) < s holds for all 1 < i < |t/|, it is easy to see that 
\cr{h){ui) - cr E {li){ui)\ < s, which is captured by the following claim. 

Claim A: | cr(h)(u) - cr E {h){u)\ < s for every u e U. 

It follows from the definition of cr £ that whenever cr{h){u) = 0, then also cr E {h){u) — 0. This means that 
any history executable using cr E is also executable using cr, i.e. Eller 8 ) c Eller). 
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Now, knowing that att-valh(cr) is defined if att-valh{cr E ) is defined, we prove the following: 


att-valh(cr e ) > att-valhicr ) - e ( 2 ) 

> valiast(h)(crh) - £ (3) 

> vfl/fcr) - £ . (4) 

• The inequality I© directly follows from Proposition IB. 21 as h e 74 (<x). 

• The inequality ([3]) clearly holds as forcing the attacker to attack immediately cannot decrease the value 
of the game. 

• To prove the first inequality d2]), we have to analyze the impact of the rounding in the definition of cr £ . 

Denote by R[l, li, t, k ] the probability of reaching t e T from last{h), h e 74, in up to k steps using the 
strategy i h . 

We prove by induction on k that for all h e 74. t e T, and k e N we have that 


f?[cr, h, t, k] - R[cr E , li, t, k] < k\U\s . 

The base case (k = 1) directly follows from Claim A for all u € U and the fact that R[l, li, t, 1] = i{h){t) 
for every defender’s strategy l. 

Let us denote the difference R[cr, h, t, k] - R[cr E , h, t, k] by A. For k > 2, we have 

A = cr(h)(t) + ^ <r(h)(u) ■ R[cr, hu, t,k - 1] - (5) 

- (r s (h)(t) ~ z c t e {1i){u ) • R[cr e , hu, t,k - 1] 

ueU\{t) 

< 5 + ^ (R[<r,hu,t,k - l\ ■ (cr(h)(u) - (r E (h)(u)) + ( 6 ) 

U€U\[t) 

+ cr E (li)(u) ■ ( 7 ?[cr, hu, t,k— 1] - R[cr E , hu, t,k— 1])) 

< s + ^ f?[cr, hu, t, k - 1] • s + (7) 

wet/\{?} 

+ ^ a e (h)(u)-{k-\)\U\s 

ueU\{t } 

< s + (\U\ - l)s + (k - l)\U\s ( 8 ) 

= k\U\s . 

- The equality © follows from the definition of R\l,Ii, t,k\ as R[i,h,t,k\ = i{h){t) + Zoef/\{/| Rh)(u) ■ 
R[l, hu, t,k - 1] for all k >2. 

- The equality ([6]) is just an application of Claim A and of the formula ab - a'b' = b{a - a') + a'{b - b'). 

- The inequality © follows from Claim A and from the induction hypothesis. 

- The inequality © holds because R[<r,hu,t,k - 1] < 1 and ZueiNM o- E {li){u) < 1. 

So we have that f?[cr, h, t, d(t )] - R\(r ,: . h, t, d(t )] < c/|(/|.s. Flowever, note that 


att-vali, (r) = inf R[l, h, t, d(t)] 
teT 
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and therefore 


att-valh(cr ) - att-vaffcr 8 ) < d\U\s < e . 


□ 


C.2 Formal proof of Theorem 12.31 

In order to make lengthy computations more succinct, we use the following shorthand notation: Given 
a characteristic c = (r, s, c) e Char, we define: 

• c(0, u) = 1 if u = c r , and c(0, u) = 0 for all u + c r . 

• c(l, u ) = c s (u ) for all u e U. 

• c(k, u) = c c (k, u) for all 2 < k < d and u e T. 

Also, we use functional notation to denote vectors of characteristics (i.e. successors). That is we represent 
each (c v ) v€ [/ e Char r as a function f : U —» Char where g(y) = c v for every v e U. 

Let us formally define the notion of successor of a characteristic. We say that £ : U —» Char is 
a successor of c e Char if for every v e U holds £(v)(0, v) = 1, and for every u e T and 2 < k < d holds 

c(k, u) = c(l, u) + ^ c(l, v) • £(v)(k - 1, u) 

V^U 

A set of characteristics B c C is closed if there is at least one c e B satisfying c(0, u) = 1, and every c e B 
has a successor £ : U —» B. 

Given a defender’s strategy cr and a history h, we denote by c[cr, h] the characteristic defined as follows: 
c[cr,h\{last{h)) = 1, and c[cr, /?](1, u) = cr{h){u) for every u € U, and for every 2 < k < d and u e T we 
define 

c[cr, h](k, u) = P ,T ('R({hh' \ last(h') = u, 1 < \h'\ < k}) \ ' R(h )) 

(Intuitively, for k > 1, the value c[cr, /j](k, u) is the probability of reaching u in at least one and at most k 
steps starting with the history h and using cr.) Denote by Charfcr] the set of all characteristics c[cr, /;] where 
h € H-licr). 

Lemma C.2. Given a defender’s strategy cr, the set Char[cr] is closed. 

Proof. By definition, c[cr,u](0,u) = 1. Now consider c\<r, h ] e Char [cr]. Let £(v) = c\<r, hv\. Apparently, 
^(v) € Charfcr] so it suffices to show that f is a successor of c[cr, h\. By definition, 

^(v)(0, v) = c[cr,/n’](0, v) = 1 


and, clearly, 

c[cr, h\(k, it) ~ c[cr, h\( 1, u) + ^ c[cr, h\( 1, v) • c[cr, hv\(k - 1, Li) 

V^U 

= c[cr,h](l,u) + ^c[cr,h](l,v) ■ %(v)(k - 1 ,u) 

V^U 

which means that f is a successor of c[cr, It ]. □ 


18 




Let C be a finite closed subset of Char. We say that a finite-memory strategy cr = ( M, N, mo, £) is consistent 
with C if 

• M = C, 

• for every c e C the function K{c) defined by 

K(c)(u) = N(c, u) for all ueU 

is a successor of c. 

• = c for some c e C satisfying c(0, u) = 1, 

• fie, u) - c( 1, u ) for all u € U. 

Proposition C.3. Let C be a finite closed set of characteristics and assume that cr is consistent with C. Then 
val{cr) > min c6 c val(c). 

Proof. Let us first prove that c[cr,h] e C for every history h e 'H(cr). Let us fix a history h. We prove 
that c[cr, /?] = N{c,h), i.e. that c[cr,h](k,ii) = N{c,h){k,u) for all 0 < k < d. It is easy to show that 
N(c, /j)(0, last{h)) = 1. For k > 0 we proceed by induction on k. Immediately from definitions we obtain 
that for every u e U 

c[c, h](l,u) - cr(h)(u) = ^(A^(c, h ), u ) = N(c, h)(\,u) 

Now consider 2 < k < d. We have 

c[cr, h\(k, u ) = c[cr, h]( 1, v) + ^ c[cr, h]( 1, v) • c[cr, hv\(k - 1 ,u) 

Vj=U 

= N(c, h)(l,u) + N(c, h)( 1, v) • N(c, hv)(k - L u) 

V=U 

= N(c, h)(l,u) + N(c, h)( 1, v) • N(N(c, h), v’)(k - 1, n) 

V—U 

= /V(c, /j)(1, u) + Y Me, /?)(!, v) • K(N(c, h))(v')(k - 1, u) 

V=U 

= N(c, h)(k, u) 


Here the second equality follows by induction, the last equality follows from the fact that K(N{c, h)) is a 
successor of N{c, h). This proves that c[cr, h ] e C for every history h e Tiler). 

Now since every defender’s strategy cr satisfies 

att-valifcr, u) = c[cr, h](d(u), u) 


we obtain 


val{cr) = 


> 


inf P?(D[7t]) 

71 

inf Yj T’i'RUi)) ■ att-valh(cr, n{h)) 

he r H{cr),n(h)e.U 

inf Yj P(R(h)) ■ c[ cr, h \(d(ji(h)), jc(h)) 

71 he < H(cr).jT(h)eU 

inf V PPR(h)) ■ min val(c) 

n ceC 

he'HWrthteU 


min vfl/(c) 

cgC 


□ 
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Let Char e be the set of all characteristics c such that cik, u) is an integer multiple of s k , here s = \\U\d/s\ 1 , 
for every 1 < k < d and every u e U. 

Lemma C.4. The set Char £ contains a (finite) closed subset C such that min ce c val(c ) > val - s. 

Proof. It suffices to consider cr e of Proposition 1C. 11 Then Char[cr £ ] c Char £ is a closed subset. □ 

Given any closed subset C of Char £ satisfying min ce c val(c) > val - s, we obtain, via Proposition IC.31 a 
finite-memory £-optimal strategy. So it remains to give an algorithm for computing such a closed subset C. 

The Algorithm 

The following procedure computes a closed subset C of Char e which maximizes min ce c val{c ) among all 
closed subsets of Char £ (so in particular, satisfies the desired bound min ce c val(c) > val - s). 

Let ci,..., c„ be all characteristics of Char £ ordered in such a way that for arbitrary c,-, cj we have that 
val{c{) < val{cj) implies i < j. The following procedure maintains the invariant that A = {ci,..., ty) for 
some k > 0 and computes the desired closed set C : 

1. Initialize A := {ci}. 

2. Compute a closed subset of A, or indicate that A does not contain a closed subset as follows: 

a. Initialize B := A, 

b. compute B' as the set of all c e B that have successors in B. 

c. depending on B' do: 

* if either B' = 0, or there is no c e B' such that c(0, u) = 1, then indicate that there is no closed 
subset of A (and proceed to 3.), 

* else, if B = B\ then return fiasa closed subset of A (and proceed to 3.), 

* else assign B := B' and go to b. 

3. If A = {ci,..., cu\ does not contain a closed subset, then add c*+i to A and go to 2., else return the closed 
subset as a result. 


Correctness 

In step 2., the algorithm computes the greatest closed subset of A using a straightforward iterative algorithm. 
As the characteristics are added to A in the order of non-decreasing value and there exists a closed subset C of 
Char £ satisfying min ce c val(c) > val - s (due to Lcmnia [C.4l ). a subset C r satisfying min ce c' val(c) > val - e 
is computed when C c A for the first time. 


Complexity 


Let us denote by 0 the size of Char £ . It is straightforward to show that 

computation in step 2. b. takes time in @ 0( IG) (f or every characteristic of B 
successors, i.e. vectors of the form £ : U —» B). The whole algorithm iterates 
3. (a characteristic is added to A in every iteration except the last one). So the 


0 € 



OQU\d 2 ) 


Now the 


one has to check all possible 
at most 0 times through 1. - 
total complexity is at most 


0<AIG) = 
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D A bound on the Stackelberg value 

Using the arguments of the proof of Proposition IB. 21 the following proposition can be shown. 

Proposition D.l. Let Q — (if, T, u, E, d) be a patrolling problem. Further, let cr be an optimal defender’s 
strategy and h e Li(ir). Then vali ast ^fcri 7 ) = val{cr) = val. 

Note that Proposition ID. 1 1 cannot be generalized to non-optimal strategies, i.e., for a given non-optimal cr 
and h e Li {(f) we do not necessarily have that vali ast (h){o~h) = valuer) (a counterexample is easy to find). 

Theorem 12.51 For every patrolling problem Q = {U,T,u,E,d ) such that T = U, we have that val < 
(’LkcsupptS) where S is the attack signature ofQ. 


Proof. Let cr be an optimal defender’s strategy. For all It 6 < H{cr) and i e Mo, let Nodepy : 'R(li) —> U be a 
function which to every run hw e Rih) assigns the node w;. Further, let ppy e A (U) be a distribution defined 
by Phfu) = r T (Node h ^u)ir T mii)). 

First, we show that for all u e U and i e No we have that 1 puj( u ) A val. Let us fix some u e U 

and i e No, and let Lifer) be the set of all h e Li{cr) such that |/;| = i. For every h e Lifer), consider an 
attacker’s strategy n such that n(h) = u. Due to Proposition ID. II we have that vali asl (pf(rp) = val, which 
means that P a lasl{K) (T)i as tW)\^h\) is at least val. Obviously, P^ asm RDi as m\^hY) < Hyio”' Ph.fu)- Thus, we 
obtain 1 Ph, f u ) > val. Now it suffices to realize 


i+d{u)~ 1 

Yj 

j=i 


z 

he < Hi(cr) 


d{u )-1 

r T (%m ■ Y ELM) 


7=0 


> val- Y ^(TO) 
he'Hitcr) 


val. 


Now we can continue with the main proof. Let (' = TVkesupptS) k. Since 2 * Pu,j( u ) - va ^ f° r all 


u € U and i e No (see above), we immediately obtain Zy=o Fu,j( u ) - va ^' dbj- Hence, 


d(u) ’ 


£-\ t-\ 

= YY^ (u) = YY^l ( u) 

j =0 ueU ueU j =0 


Yj Va ^ ' 


ueU 


d{u) 


= val - Y 

kesupp(S ) 


S(k)-€ 


Thus, we get val < 


(2 


S(k) \ 
kesupp(S) k ) 


-1 


as desired. 


□ 


E Solving patrolling problems with a fully connected environment 

Let Q — (U, Tii, E, d) be a patrolling problem where T = U, E = U x U, and let S be the signature of Q. We 
start by defining the semantics for the “strategy expressions” introduced in Section [2~4l precisely. 

• Circle(U(i, N), M, L) denotes the c-modular strategy where c = L ■ ( N/M ) such that the distribution pc, 
where 0 < t < c, selects uniformly among the elements of U(i + i ■ M,M) where € = { mod {N/M). 
In other words, Circle(U(i,N),M,L) is a strategy which splits U(i,N) into pairwise disjoint subsets of 
size M and then “walks around” these sets L times (actually, Circle(U(i, N), M, L) can also be seen as 
(/V/A/j-modular strategy, but for technical reasons we prefer to interpret it as a c-modular strategy). 

• if 6\ and denote ci-modular and (w-modular strategies with underlying distributions p l () ,... and 

Pq, .. - ,p 2 Cl _ v respectively, then Bp, 02 denotes the ci+co-modular strategy with the underlying distribu- 
tions pl,...,p\ i _ v pl,...,pl- v 
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• if 6 \ and 0 2 denote c-modular strategies with underlying distributions p l (} ,... ,p\_ x and p^,... ,p 2 c _ x , then 
Then v p \ 6 \, 0 2 ] denotes the c-modular strategy with the underlying distributions /Jo,... ,p c -\, where p t = 
(1 - a(p)) ■ p] + a(p ) • pj for all 0 < i < c. 

Now we give a detailed description of the algorithm of Section 12.41 We construct a recursive function 
Defend which inputs a triple ( U(i,N),D,e ), where U(i,N ) is the set of nodes to be defended, D is the 
number of steps available for defending U(i,N), and e is an expression which represents the “weight” of 
the constructed defending strategy in the final distribution v. The procedure outputs a pair (0, V ) where 
0 is an expression specifying a D-modular strategy for U(i,N), and V is an arithmetic expression repre¬ 
senting the guaranteed “coverage” of the targets in U(i,N) when using 0 with the weight e. As a side 
effect, the function Defend may produce equations for the employed variables. The algorithm is invoked 
by Defend( U(l, \U |), d, 1), and the system of equations is initially empty. A call Defend( U (/, N), D, e) is 
processed as follows: 

• If D | N and N = k ■ D, then 0 = Circle(U(i,N),k, 1). Observe that every node of U(i,N ) is visited at 
most once in D steps, and this happens with probability D/N. If the weight of 8 is e, then this probability 
becomes e ■ (D/N) (since N and D are constants, the expression V - e ■ (D/N) is parameterized just by 
the variables of e). Hence, the function returns the pair ( 8 , V). 

• If N | D and D = k-N, then 8 = CircIe(U(i,N), 1, k). Every node of U(i,N) is visited precisely k times in 
D steps. If the weight of 8 is e, then the probability of visiting a given node in D steps is V = 1 - (1 - e) k . 
The function returns the pair ( 8 , V). 

• If N > D, D \ N, and N = k ■ D + c where 1 < c < D, then we split the set U(i,N ) into disjoint subsets 
U(i,k ■ D) and U(i + k ■ D,c) with precisely k ■ D and c elements, respectively. Then, we pick a fresh 
variable p and issue two recursive calls: 

(0i, Vi) = DEFEND((7(/,k • D),D,( 1 - p) ■ e), ( 82 , V 2 ) = Defend(£/(/ + k ■ D,c),D,p ■ e) 

The set of equations is enriched by Vj = VV That is, we require that p is chosen so that the nodes of 
U(i , k ■ D) and U(i + k ■ D, c > are protected equally well. Then, we put 8 = v p { 8 \, 82 ] and we set V - V\. 
The function returns the pair (0, V). 

• Finally, if D > N, N { D, and D = k ■ N + c where 1 < c < N, we issue two recursive calls: 

(0i, Ei) = Defend (U(i, N), k ■ N, e), (0 2 , V 2 ) = Defend (U(i, N), c, e) 

This is perhaps the most subtle part of our algorithm. Here we do not split the set U(i,N), but the 
number of steps available to protect U(i,N). Intuitively, the constructed strategy 0 first tries to loop over 
the targets of U(i,N ) as long as possible (i.e., for the first k ■ N steps). This is what 0i does. Then, 0 
tries to exploit the remaining c steps in the best possible way, i.e., by employing 0 2 . That is, we put 
0 = 0i;0 2 . If the weight of 0 is e, then the targets of U(i,N) are protected with probability at least 
V = 1 - (1 - Ei)(l - V 2 ). The function returns the pair (0, V). 


F The existence of a characteristic subdigraph 

In this section we prove the non-trivial direction of Theorem l2.71 i.e., we show that if Q = (U, T, u, E, d) is a 
patrolling problem with T = U, a well formed attack signature S , and a sufficiently connected environment, 
then Mj is (d-preserving isomoiphic to) a subdigraph of (U,E). 

Let us assume that E is sufficiently connected, and let <x be a defender’s strategy for Q such that val(cr) = 

( Tikesupp(s ) hf) ■ Due to Theorem 12.51 we obtain that cr is optimal, i.e., val = val(cr), and hence we can 
apply Proposition ID. ll to cr. 
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We reuse the notation introduced in the proof of Theorem 12. 5 1 In particular - , for all h e 'Flla) and i e No, 
we use Nndeh i : Rlh) —> U to denote a function which to every run hw e Rlh) assigns the node Wj. Further, 
we use Hh,i £ A (U) to denote a distribution defined by ///, ,(«) = P (T (Nodei 1 j=u)/‘P cr (R(h)). We start by 
realizing the following: 

Lemma F.l. For all h e Flla) and u e U, we have that Y^=q * Ph,il u ) — val. 

Proof For all h e Filer) and u e U wc have that 

d(u)~ 1 

^ PhM) > val Iast(h) l(r h ) - val 
i =0 

where the last equality is due to Proposition ID. II Now suppose that there exist some h e Filer) and u e U 
such that Y fig q/,,/(«) > val. Let € = HkesupptS) k. For every k e supplS), we put 

a[k] = XI Xi ■ 

ueU, d(u)=k i =0 

Obviously, YkesupptS) = A Further, for every k e supplS) we have that a\k\ > val-S(k)-j : . because other¬ 
wise there inevitably exists some 0 < i < l-k and u e U such that d(u) — k and Y'J^f "^' dhf 11 ) < val, which 

means that there exists hh' e Filer) such that \h'\ = i and Yj^q - 1 Phi/jlu) < val. Since vali ast (hh i )lo'hh 1 ) = val 
by Proposition ID. 11 we have a contradiction. 

Since a\k\ > val ■ S(k) ■ j for all k € supplS) and Ykesupp(S) a \k\ = L we obtain that a\k\ = val ■ S(k) ■ j 
for all k € supplS). Similarly, for every k e supplS), every u e U where dlu) = k, and every 0 < i < £ - k 
we must have that Y'+Jf "° 1 Ph.ilu) ^ val (otherwise we obtain contradiction in the way indicated above), 

which is possible only if p hi (u) = val for all such i and u. In particular, this holds for i = 0, and 

the proof is finished. □ 

Now we present a sequence of observations that reveal a certain form of periodicity in the structure of cr. 
The next lemma follows trivially from Lemma iFTl 

Lemma F.2. For all h e Filer) and u € U we have that cr(h)(u) < vallcr). 

Lemma F.3. Let li e Filer) where lastlh) = u. Then for every hh' e FI la) where \h'\ < dlu) we have that 
lastlh') ± u. 

Proof. Suppose the converse. Then there exist hh' e Flla) and a node u € U such that lastlh) = lastlh') = u 
and |/i'| < dlu). Due to Proposition ID. II we have that vaZ H (cr/ ! ) = val. Further, Y'l^j * dhjlu) = val by 
Lemma IFTl However, due to the existence of h' we obtain that valja/,) < Zfijf -1 Ph,i{u), which is a 
contradiction. □ 

Lemma F.4. Let h € Flla) where lastlh) = u. For all i > 0 and hh' e Flla) where \h'\ = i ■ dlu) + dlu) — 1 
we have that a(hh')(u) = valla) and u does not appear among the last dlu) - 1 nodes ofh'. 

Proof. By induction on i . In the base case (/ = 0), we have that u does not appear among the last dlu) - 1 
nodes of IT by Lemma lR3l Further, by Lemma IFTl and Lemma lR3l we obtain that valfah) = ph,d(u)~ i(w). 
Hence, ph,d(u)~ i(w) = val = valla). By Lemma lR2l this is possible only if for all lih' e 'Hla) where 
\IT\ = dlu) - 1 we have that alhlTfu) = valla). For the inductive step, consider hh'h" e 'Hla) where |/;'| = 


23 








i-d(u)+d(u) - 1 and \h"\ = d(u). By applying induction hypothesis to lilt', we obtain that cr(hh')(u ) = val(cr). 
If u was revisited in the last d(u) - 1 nodes of h", we would have Z-Iq* ' dhh’,i(u) > val, which contradicts 
Lemma IrTI If cr(hh'h")(u) < val(cr), we obtain Z^ -1 dhh'u'j( u ) < val, where u' is the first node of h", 
which again contradicts Lemma IrTI □ 

Lemma F.5. Let hh' e FI(cr) where lastih ) = lastih') = u. Then d(u) divides \h'\. 

Proof. Directly from Lemma lR4l □ 

Lemma F.6. Let h e 74(cr) where lastih) — u. For every i e N, there exist hh' e Flier) such that \h'\ = i■ d(u) 
and lastih’) = u. 

Proof Immediate. □ 

For the rest of this section, let us fix a history h = uq • ■ • u m e 'Flier) such that every node of U appears 
in h (such an h must exist). For every u e U, let us fix some j < m such that uj = u, and let ojfset{u) = 
j - • d{u). Note that due to Lemma lR5l the definition of offset{u ) is independent of the concrete choice 

of j. For every k e supp{S ) and every i e {0,... ,k - 1}, let I4W be the set of all nodes u e U such that 
diu) = k and offset(u) = i. 

Lemma F.7. Let k, k' e supp{S ), i e {0,..., k— 1}, i' € JO,..., k'—l}, and 0 < l <k - k! where i = (mod k 
and i' — i+\ mod k'. Then for all u e \f\i\ and u' e 14' I/] we have that ( u , u') e E. 

Proof. Due to Lemma lR6l there exist hh' e ’Hicr) and hh" e Flier) such that \h’\ = l, \h”\ = l + I, 
lastih') = u, and lastih") = u'. By Lemma lR4l we obtain cr(hh')(u') = val, which means iu, u') e E. □ 

Lemma F.8. For all k € suppiS) and i e {0 ,... ,k - 1 j, the set 14M contains exactly Sik)/k nodes. 

Proof. By applying Lemma IRTI □ 

Due to Lemma lR8l we have that for all k e suppiS) and i e (0 .... ,k - I}. the set 14FI has exactly 
Sik)/k elements, which we denote by v*[i, 1],..., v*[i, Sik)/k]. Due to Lemma lR7l for every pair of nodes 
VkU-j] and v k ’\i',j'\, such that i = Imodk and i' = i£+\)modk' for some 0 < t < k ■ k' we have that 
(v k \i, j], ly [/', /]) € E. Hence, iU, E) contains a subdigraph which is 4-preserving isomorphic to M$. 

G Complexity of finding the characteristic subdigraph 

In this section we prove two claims leading to combined Theorem 12.81 via Theorem 12.71 We will focus on 
a subclass of patrolling problems Q = ( U, T, u, E, d) such that T = U, suppiS) = {&}. In such a case, for a 
well-formed attack signature S, we have that \U\ = n is divisible by k and that the characteristic digraph M s 
has a particularly nice description: M s has a node set liq, ... u n -\ and UjUj is an arc iff j = (/ + 1) mod k. 

Our proofs will actually be expressed in terms of a special equitable k-colouring of the complementary 
digraph El = iU,E) of the environment E (i.e., H having precisely those arcs, but not the loops, which 
are absent in E): Let 11/| = |R(//)| = a ■ k. The task is to find a colouring c : V(H) —> {1,2,... ,k) of the 
node set such that (a) |c _1 (/)| = a for each i = 1 ,... ,k, and (b) no arc xy of H receives colours c(x) - j, 
c(y) = ij mod k) + 1 for some ye (while both x, y might receive the same colour). Comparing this 

with the definition of M s one immediately concludes that fU, E) contains a subdigraph isomoiphic to M s if, 
and only if, the complement H has a special equitable k-colouring. 
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Lemma G.l. For a simple digraph FI on an even number of nodes, one can find in polynomial time a special 
equitable 2 -colouring of H, if it exists. 

Proof Note that our definition of a special 2 -colouring does not allow for arcs having two distinct colours 
on their nodes, in either order. Hence every weak component of FI must be monochromatic (recall that a 
weak component is a connected component of the underlying undirected graph of H). The problem thus 
reduces to finding a subset of weak components of H summing to exactly half of the nodes of H. This we 
solve in polynomial time by two folklore algorithms; finding the weak components by BFS, and solving the 
knapsack problem in unary notation by standard dynamic programming. □ 

Lemma G. 2 . Let k e N, k > 3 . Assume a simple digraph H such that \ V(H)\ is divisible by k. Then it is 
NP-complete to decide whether H has a special equitable k-colouring. 

Proof First to say, there does not seem to be an easy way how to reduce a case of k > 3 to that of k + 1 , 
and so we have to provide hardness reductions for each considered value of k. We reduce from the folklore 
NP-complete problem of two-colouring 3 -uniform hypergraph: Given is a ground set X and a family F of 
3 -element subsets of X (hyperedges). The task is to decide whether the elements of X can be assigned one 
of two colours each such that no set in F is monochromatic. 

(k = 3 ) For such a 3 -uniform hypergraph (X, F) we first construct an equivalent instance H of the 
special equitable 3 -colouring problem. Let a = 3 \F\ + |X|.@ We denote by A3 the digraph of a' = a + \F\ 
nodes s\,S2,..., V and of a' - 1 arcs siSj for i = 2 ,..., a' (A3 is a star), and by B the digraph on a- \F\ nodes 
with no arcs at all. Then we construct a digraph G3 on the node set X U F' where F^ is a set containing 
exactly three distinct copies of each hyperedge / e F. The arcs of G3 are given as follows; for 

each / = {xi,*2,-Ts} £ F there is a directed 6-cycle on the nodes x\, f, X2, f', X3, f” in this cyclic order (a 
permutation of x\, X2, X3 is irrelevant, though). A digraph H is constructed from the disjoint union of A3, B 
and G3, by adding arcs from the node 5] to all the nodes in X of G3. 

Then H has exactly a + \F\ + a - \F\ + |X| + 3 \F\ = 3 a nodes, and we claim that H has a special equitable 
3 -colouring if, and only if, (X, F) is two-colourable. In the forward direction, up to symmetry between the 
colours, we may assume that 33 gets colour 1 , and so all nodes of A3 have colours 1 or 3 . We argue the 
following properties: 

(i) The nodes in X can only receive colours 1 , 3 . 

(ii) Among the nodes of G3 not in X, at least \F\ of them must receive colour 2 . 

Here (i) follows from the fact that each node in X ends an arc starting in 33 (of A3) of colour 1 . To get 
(ii), notice that we have to assign colour 2 to exactly a nodes which cannot appear in A3 and in X due to 3i 
having colour 1 . We can give colour 2 to the nodes of B , yet, at least a - \B\ = \F\ of the nodes of colour 2 
must be in G3 \ X. 

Now we prove that if (i),(ii) hold true, then the hypergraph (X, F) is two-colourable. Consider one of 
the 6-cycles of G3, say the one on the nodes x\, f, X2, f', X3, f". It cannot happen c{f) = c(/') = 2 —in 
such a case, depending on the colour c{x 2), there would be an arc in G3 coloured with a forbidden pair 1,2 
or 2 , 3 . Hence each of the 6-cycles defining the arcs of G3 (for each / = (33, 32, 33! e F) has at most one 
vertex of colour 2 , and so exactly one such. Up to symmetry, let c(/) = 2 in (any) one of the cycles. Then 
c(x 1) + c(x 2), since otherwise c(x \) = c(xf) £ { 1 , 3 ! would again give a forbidden pair of colours 1,2 or 2, 3 , 
respectively. Consequently, taking the colouring c restricted to X, no hyperedge in F is monochromatic and 
(X, F) is two-colourable. 

2 Although the formula for a might seem arbitrary now, this precise expression will become relevant with the case of k = 6. 
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Conversely, consider a two-colourable 3-uniform hypergraph (A, T). Let the colours occuring in A be 
1 and 3. We extend this to a special equitable 3-colouring of our digraph H as follows. If a hyperedge 
/ = {.xi,X2,X3} e T is coloured 1,1,3, then we assign colours 1,1,1,3,3,2 in order to the 6 -cycle on 
the nodes x\, f, X 2 , f', x?,, f" in G 3 . If this / = [xi,X 2 ,X 3 ) e T is coloured 1,3,3, then we assign colours 
1,1,3,3,3,2 in order to the same 6 -cycle. We finally assign colour 1 to ,V], colour 2 to all nodes of B (and 
so c -1 (2) = a), and an arbitrary choice of colours 1,3 to the remaining nodes of A 3 in order to “balance” 
c _ 1 (l) = c _ 1 (3) = a. 


(k = 4) Second, we modify the previous construction of H for the case of k = 4. We use the same B 
and G 3 . We replace A 3 with a digraph A 4 which is the complete digraph on a' = a + \T\ nodes s\,S 2 ,..., s a >, 
too. Then we add a new digraph C 4 formed by a nodes with no arcs between. H is constructed from a 
disjoint union of A 4 , C 4 , B and G 3 by adding all the arcs from .v 1 to C 4 and all the arcs between the nodes of 
A 4 and of X c V(Gt,) in both directions. Clearly, H has 4 a nodes. 

Consider a special equitable 4-colouring of H. Again, up to symmetry, let the colour of .V| be 1. Then 
whole A 4 and X may only receive colours 1 or 3 and (i) holds true again. Since no node of C 4 may be 
coloured 2 due to the existence of an arc from ,V|, and since B (which may be coloured by 2) has size a - \F\, 
we get (ii), too. Now, notice that the argument following (i),(ii) above did not use the pair of colours 3,1 as 
forbidden, and so it applies now as well; (X, T) is two-colourable. 

Conversely, consider a two-colourable 3-uniform hypergraph ( X , T). Then, exactly as in the case of 
k - 3, we get a valid colouring of A 4 U B U G 3 which we complement by assigning colour 4 to whole C 4 . 
This results in a special equitable 4-colouring of H. 

(k > 5) Third, we define a general construction for all the values k = 5,6.We use the same gadgets 

G 3 , B, and A 4 , and introduce k- 3 disjoint copies of C 4 which we denote by C 4 , C 5 and D 5 ,_A.- 1 - Again, 

on the disjoint union of all these digraphs (which has k • a nodes) we define H by adding 

• all the arcs between the nodes of A4 and of D5 U • • • U D^-i in both directions, 

• all the arcs between the nodes of A 4 U D 5 U • • • U D&_i and the nodes X of G 3 in both directions, 

• all the arcs between the nodes of D 5 U • • • U Dk~\ and of B in both directions, 

• all the arcs between .s 1 and the nodes of C 4 in both directions, and the same between S 2 and C 5 . 

Consider a special equitable ^-colouring of H. For simplicity we call the forbidden pairs of colours 

j, (j mod k) + I as adjacent. Since A 4 U D 5 U • • • U D/ { _ 1 has (k - 4)a + 1 nodes, at least k - 3 distinct colours 
must occur there. However, A 4 (of > a nodes) itself gets at least two distinct non-adjacent colours ci,C 2 
which cannot be adjacent to any of the colours occuring in D 5 U • • • U \ other than C\,C 2 - A simple case 
analysis shows that the only valid choice of colours is ci = 1, C 2 = 3 and remaining 5, 6 ,... ,k - 1, up to 
rotation symmetry. Consequently, A 4 holds only colours 1,3 and each of the colours 5,6,.... k - 1 occurs 
somewhere in D 5 U • • • U Dk~\. In particular, no node of A 4 U D 5 U • • • U Dk~\ is coloured 2. 

Which nodes could have colour 2? Due to the arcs to and from ,?i, S 2 in A 4 , all the nodes of colour 2 
belong to B U (G 3 \ A), and since G 3 \ X has 3>\T\ < a nodes, we have c _1 (2) n B + 0. This has a twofold 
consequence; first, (ii) holds true also in this case, and second, colours 1,3 cannot occur in D 5 U • • • U D*_i. 
Then, by simple counting, c -1 (5) U • • • U c~ l {k - 1) must be exactly the node set of D$ U • • • U D^~ 1 , and 
hence X cannot get any of the colours 5,... ,k- 1. Neither colours 2,4 or k could occur in X due to the arcs 
to and from A 4 , which concludes that (i) holds true, too. Theorefore, (A, T) is two-colourable. 

Conversely, consider a two-colourable 3-uniform hypergraph (A, T). We colour G 3 U A 4 by 1,2,3 as 
above while giving c(s 1 ) = 1 and cGc) = 3. Then we assign colour 2 to whole B. colour 4 to whole C 4 , 
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colour k to whole C 5 , and colours j to whole Dj for j - 5,..., k- 1. Again, this results in a special equitable 
^-colouring of H. □ 
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